Why your model parameter confidences might be too optimistic – unbiased estimation of the inverse covariance matrix

نویسنده

  • P. Schneider
چکیده

Aims. The maximum-likelihood method is the standard approach to obtain model fits to observational data and the corresponding confidence regions. We investigate possible sources of bias in the log-likelihood function and its subsequent analysis, focusing on estimators of the inverse covariance matrix. Furthermore, we study under which circumstances the estimated covariance matrix is invertible. Methods. We perform Monte-Carlo simulations to investigate the behaviour of estimators for the inverse covariance matrix, depending on the number of independent data sets and the number of variables of the data vectors. Results. We find that the inverse of the maximum-likelihood estimator of the covariance is biased, the amount of bias depending on the ratio of the number of bins (data vector variables), p, to the number of data sets, n. This bias inevitably leads to an – in extreme cases catastrophic – underestimation of the size of confidence regions. We report on a method to remove this bias for the idealised case of Gaussian noise and statistically independent data vectors. Moreover, we demonstrate that marginalisation over parameters introduces a bias into the marginalised log-likelihood function. Measures of the sizes of confidence regions suffer from the same problem. Furthermore, we give an analytic proof for the fact that the estimated covariance matrix is singular if p > n.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum likelihood spatiotemporal EEG/MEG source analysis

EEG/MEG noise has an unequal variance and is correlated, both in space and in time. Noise variance may differ greatly between samples or sensors, and correlations between samples or sensors can be very high [1-4]. If these noise characteristics are neglected, then an EEG/MEG source analysis will yield unreliable results [e.g. 5, 6]. First, source parameter estimates will be inefficient. That is...

متن کامل

An estimator of the inverse covariance matrix and its application to ML parameter estimation in dynamical systems

An exact formula of the inverse covariance matrix of an autoregressive stochastic process is obtained using the Gohberg}Semencul explicit inverse of the Toeplitz matrix. This formula is used to build an estimator of the inverse covariance matrix of a stochastic process based on a single realization. In this paper, we show that this estimator can be conveniently applied to maximum likelihood par...

متن کامل

A Newton Root-Finding Algorithm For Estimating the Regularization Parameter For Solving Ill-Conditioned Least Squares Problems

We discuss the solution of numerically ill-posed overdetermined systems of equations using Tikhonov a-priori-based regularization. When the noise distribution on the measured data is available to appropriately weight the fidelity term, and the regularization is assumed to be weighted by inverse covariance information on the model parameters, the underlying cost functional becomes a random varia...

متن کامل

Comparing Different Marker Densities and Various Reference Populations Using Pedigree-Marker Best Linear Unbiased Prediction (BLUP) Model

In order to have successful application of genomic selection, reference population and marker density should be chosen properly. This study purpose was to investigate the accuracy of genomic estimated breeding values in terms of low (5K), intermediate (50K) and high (777K) densities in the simulated populations, when different scenarios were applied about the reference populations selecting. Af...

متن کامل

Regularization parameter estimation for underdetermined problems by the χ principle with application to 2D focusing gravity inversion

Abstract. The χ-principle generalizes the Morozov discrepancy principle to the augmented residual of the Tikhonov regularized least squares problem. For weighting of the data fidelity by a known Gaussian noise distribution on the measured data and, when the stabilizing, or regularization, term is considered to be weighted by unknown inverse covariance information on the model parameters, the mi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006